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Abstract 

We consider a Markov chain on the space of (countable) partitions of the interval [0,1], obtained first 
by size biased sampling twice (allowing repetitions) and then merging the parts (if the sampled parts are 
distinct) or splitting the part uniformly (if the same part was sampled twice). We prove a conjecture of 
Vershik stating that the Poisson-Dirichlet law with parameter = 1 is the unique invariant distribution 
for this Markov chain. Our proof uses a combination of probabilistic, combinatoric, and representation- 
theoretic arguments. 
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1 Introduction 



Let f^i denote the space of (ordered) partitions of [0, 1], that is 

Ox := {p G [0, 1]^ : Pl > p 2 > ... > 0, = l} , 

where \x\i = \ x i\ for any finite or countable sequence (xi). By size-biased sampling according to 
a point p G fii we mean picking the j-th part pj with probability pj. Our interest in this paper is 
in the following Markov chain on Qi, which we call a continuous coagulation-fragmentation process 
(CCF): size-bias sample (with replacement) two parts from p. If the same part was picked twice, 
split it (uniformly), and reorder the partition. If different parts were picked, merge them, and 
reorder the partition. 

We denote by DCF( n ) (discrete coagulation-fragmentation) the Markov chain describing the evo- 
lution of the cycle lengths of permutations of {1, . . . ,n} under random transpositions. The CCF 
process appears in a variety of contexts, but of particular relevance to us is its occurrence as a 
natural limit of DCF^ n ), when n increases, see |16| for a discussion of this and its link with the 
space of "virtual permutations". 

For any n G IN denote 

V n :={£= (£i)i>i G {0,1,... .n}^ : 4 > l 2 > ■ ■ ■ > 0, \£\i = n} c nfii . 
(Elements in V n may be thought of as being of length n; the remaining entries are necessarily zero). 

A sequence i G V n is uniquely determined by its type (Nf(k) = ${i : £i=k})"_ i , with Nt=^2 Ni(k) 
denoting I's total number of parts. 

The long-time behaviour of the DCF^ ra ), viewed as an evolution in V n , is well understood. In 
particular, see e.g. [2|, it possesses a unique stationary distribution given by the Ewens formula: 

(n \ — 1 / n n \ — ^ 

k=l J \i=l k=l J 

It is well known, at least since |11 [I1H[ IT%]. that the measures TVg(n-) on Q\ converge weakly to the 
Poisson-Dirichlet distribution Jm% with parameter 9 = 1 (a precise definition of jui is given below in 
Section 12.1(1 . It has been shown in more than one way (cf . [SJ 1151 116j ) that fi\ is invariant for the 
CCF transition. This fact, and hints coming from the theory of virtual permutations, led Vershik 
(see PU) to 

Conjecture 1.1 (Vershik) jui is the unique invariant distribution for the CCF. 

Our goal in this article is to prove Vershik's conjecture. A naive approach toward the proof would 
be to use the link with the DCF^ n ) and the fact that the latter converges to the distribution tt^ 
exponentially fast. However, the rate of that convergence deteriorates with n. To overcome this 
difficulty, our strategy consists of the following steps: 



2 



1. We provide a-priori estimates (Proposition 12. 1 j) showing that every invariant distribution for the 
CCF leads to a good control on the number of "small parts". 

2. We couple the DCF^ and the CCF in such a way that whenever they start from initial distri- 
butions with such control on the tails, the decoupling time is roughly y/n (Theorem 13.1(1 

3. For initial conditions as above, and for an appropriate class of test functions, we show by using 
some harmonic analysis on the symmetric group that the DCF^ n ^ achieves near equilibrium 
before the decoupling time (Theorem 14. lj) . 

These steps are then combined in Theorem 15. II to yield the proof of Vershik's conjecture. 

Our work began from discussions with Bob Brooks on various models for "random Riemann sur- 
faces". Brooks and Makover [2j, [2] studied Riemann surfaces via a dense set of "Belyi surfaces" 
associated to three-regular graphs on n vertices with an orientation at each vertex. Their con- 
struction gives a complete Riemann surface with finite area irn for each graph. Uniformly choosing 
a random three-regular graph gives a probability distribution on Riemann surfaces, see [7] for an 
accessible account of this model. Practical choice of a random three-regular graph is not so easy 
when n is large. Brooks proposed a Markov chain method which involved splitting and joining 
cycles; investigating properties of his algorithm gave rise to the present paper. 

We next review some of the literature on this question. Tsilevich, in ^S], proves that /2i is the 
only CCF-invariant measure that is also invariant under additional symmetry conditions. Pitman, 
in ^5], proves that /2i is the only CCF-invariant measure which is also invariant under size-biased 
sampling. Related results appear in [S]. In another direction, it is shown in ^1] that fii is the only 
CCF-invariant measure that is analytic in the sense that for any k, the law of an independently 
size-biased sample (with replacement) possesses an analytic density. Finally, Tsilevich in ^7] shows 
that the law of the CCF, initialized at p = (1,0, . . .), and stopped at a Binomial(n, 1/2) random 
time, converges to jui. 

We conclude this introduction by noting that in |14j . we have introduced a slightly more general 
model of split-merge transformations, by allowing either the split or the merge operations to be 
rejected with a certain probability. An invariant measure for these generalizations is the Poisson- 
Dirichlet law of parameter 6 > 0. The discrete counterpart of this chain has been analyzed in [51 
Section 4]. While it is plausible that the techniques of the current paper can be adapted to that 
setup using the results of 5. , we do not pursue this generalization here. 

2 Continuous and Discrete Coagulation-Fragmentation 
2.1 Preliminaries and CCF 

Given a topological space W, its Borel cr-algebra will be denoted by Bw , and the space of probability 
measures on (W,Bw) by M.\{W). By a slight abuse of notations, Aii(V) will also be M.\(Wys 
subspace of probability measures whose support is contained in a given closed subset V of W. The 
total variation of a measure v is denoted by ||^|| V ar- 
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We equip Oi with its relative | • ^-topology which, on Oi, coincides with the weak (coordinatewise 
convergence) topology. 

On Qi we consider the Markov chain CCF in which two segments and pj of a given partition 
p are size-bias sampled with replacement and then, if i ^ j they merge into one of length pi + pj 
(coagulation), while if i=j, p% splits into two new parts upi, (1 — u)pi with u ~ U[0, 1] independent 
of all the rest (fragmentation) . In either case the new partition is then rearranged nonincreasingly. 

Recall that the Poisson-Dirichlet law fli is invariant for the CCF transition. Indeed, /Ii itself has 
been defined in a variety of manners ( [Tl lllj) which are well known to be equivalent. Perhaps the 
simplest is the GEM description in which segments are successively and uniformly removed from 
whatever remains of [0, 1], and then rearranged nonincreasingly. Namely, let Y\=l and for nGlN 
define X n = U n Y n , (the removed part at stage n) and Y n+ \ = Y n — X n (the remaining segment from 
which the (n + l)-th part is to be removed), where the U n 's are independent U[0, 1] variables. Since 
Y n+ i = (l — U n )Y n it follows that 1— Y n+ i = Yli=i X-i increases almost surely to 1 as n — ► oo. The 
distribution on Q,\ of the nonincreasing rearrangement (pi)i of (X n ) n is called the Poisson-Dirichlet 
law (with parameter 8 = 1) and denoted jui. 

As has been mentioned in the Introduction, it is the ultimate goal of this work to show that 
the Poisson-Dirichlet law is the only CCF-invariant probability distribution. It will be crucial for 
the main argument to establish in advance that any such invariant distribution does not put too 
much weight on very small parts: 

Proposition 2.1 Let fi £ Mi(£li) be CCF-invariant. Then 

i>l 

The proof is deferred to the Appendix. 
2.2 DCF 

In this section we formally introduce the coagulation-fragmentation chain on the discrete version 
of Oi, in which the partition points lie on a finite equidistant grid in [0, 1], or its equivalent state 
space V n , the set of integer partitions of a fixed n £ IN defined in the Introduction. It will be 
helpful to view V n as the conjugacy classes of the permutation group S n . 

The DCF^™) Markov chain on V n is defined similarly to the CCF chain on VL\. Identify each t G V n 
with a partition (Jj^i of {1, 2, . . . , n}, where for each i, denotes the cardinality of Ai, and sample 
two independent integers x,y uniformly from {1, . . . ,n} and without replacement, say x£A{ and 
y&Aj. Hi ^ j replace Ai and Aj by Ai U Aj while if i = j (in which case ii > 2 since x ^ y £ Ai) 
replace Ai by two of its subsets, consisting respectively of Ai's k smallest elements and of the ti—k 
remaining ones, where k is uniformly sampled from {1,... ,£{—1} independently of x and y. In 
either case relabel and rearrange the new A^s if necessary. 
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The transition matrix of DCF^" - ) is described as follows: To split into or merge two parts 
of different sizes j and k (l<j<k<n), let £, £' G V n be such that N v (j) = N t (j) - 1, N & {k) = 
N e (k)-1, N et {j+k)=Ni(j+k) + l and N e ,(q)=N e (q) for all q£{j,k,j + k}. Then 

KM (£,£') 

(£',£) 

To split into or merge two parts of the same size j with 2 < 2j < n let £,£' € V n and < Np(j) = 
N e (j) - 2, N t >(2j) = N e (2j) + 1, and N e (q)=N t (q) for all q?{j,2j}. Then 

K^(£,f) = j2 N e (j)(N e (j) - 1) merge 
n(n — 1) 

(£',£) = — *—N e (2j) split (2.3) 

nyn — 1) 

All other entries of the transition kernel are zero. 

It is customary to think of the representation {1, 2, . . . , n} = IJAj above as the notation for the 
conjugacy class of a permutation a £ S n . Seen this way, the DCF^ n ) transition is nothing but 
the action of a random transposition on S^s conjugacy classes. Since the random transposition's 
unique stationary probability measure is the uniform law on S n (being a finite group convolution), 
one concludes that the DCF^'s unique stationary probability measure is the one induced on S n 's 
conjugacy classes by the uniform law, namely (the Ewens sampling formula). In fact, DCF^ n ^ 
is reversible with respect to tt^\ which can also be checked directly by using (|2.2|) . ()2.3|) and (jl.lf) 
to verify the detailed balance equation K^(£, £' )^g\i) = K {n \£' ,£)Trf ] {£'). 



2jk 

"" ' N e (j)N t (k) merge 



n(n — 1) 
2{j + k) 



.N e ,(j + k) split. (2.2) 

n(n — 1) 



3 Coupling of CCF and DCF 

In order to successfully approximate a CCF chain by DCF( n ) chains as n — > oo it is necessary to 
couple them on a common probability space. 

Theorem 3.1 For all \i € M\{$1\) and a < 1/2 satisfying 

/"j>fd/i<oo, (3- 1 ) 
J i>l 

it is possible to define for all n > 1 a CCF Markov chain p(k) (k > 0) with initial distribution fi 
and a DCF^ Markov chain £{k) (k > 0) on the same probability space with probability measure 
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and expectation E^ in such a way that 



(n) 



lim Q(») 



^(0) > ^ 



lim 



p(KJ) 



/or all a < (5 and 



for all (3 < 1/2. 



(3.2) 
(3.3) 



Proof: Fix n > 1. We shall construct a Markov chain (c k ,d k ,e k ) (k > 0) on the state space 



(n) 



|(c, d, e) | c : [0, n) — > Z measurable, Z\c[[0, n)] infinite, 
d: {l,...,n} ->Z, eG{0,l}}. 



Here c and <i describe a continuous partition of [0, n) and a discrete partition of {1, . . . , n}, respec- 
tively. The interpretation of c and d in terms of elements of and V n is given by the functions 



7T r 



(n) 



Ol and Ttd : O^*, — > V n , respectively, defined by 



sort 



7r c (c, d, e) :-- 

7r d (c,d,e) := sort {($d~ v {{i})) i& ). 



/Leb (*-*({•})) 



\ n 
-I, 



and 



(3.4) 
(3.5) 



Here Leb denotes the Lebesgue measure and sort ((xj)j) is the sequence obtained by arranging the 
Xi's in decreasing order, ignoring the 0's if there are infinitely many positive Xj's. Thus two points 
x, y G [0, n) belong to the same set in the partition of [0, n) which is described by c iff c(x) = c{y). 
Analogously, x,y G {1, . . . , n} belong to the same set in the partition of {1, . . . , n} described by d 
iff d(x) = d(y). The CCF Markov chain p(k) and the DCF( n ) Markov chain £(k) will be realized as 



p(k) := 7r c (c fc , d k , e k ) and £(k) := 7r d (c k , d k , e k ). 



(3.6) 



The flag e k indicates whether the coupling between the two processes p(k) and £{k) is considered 
to be still in force (e = 0) or to have already broken down (e = 1). 

The distribution of (co, do, eo), that is the initial distribution of the Markov chain, is defined as 

the image of fj, under the function = ($J n) , <S> { ™\ 0) : fti — ► fi££ whi ch assigns to each element 
of Qi an equivalent function c and an approximating function d as follows (see Figure ^): 



^ l) (p)(x) 



*£°(p)(m) 




j'-i - 

x G [0, np 3 ) + n^pj 



$j n) (p)(m 



(x G [0,n)) 



(m G {l,...,n}). 



Thus p(0) = p and ^(0) = ^(^^^(p)) are the initial continuous and discrete partitions generated 
by p G Oi. 
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Figure 1: Constructing a continuous partition of [0,n) and a discrete 
partition of {1, . . . n} from a partition p£ flj. Here n = 8. The shaded 
area indicates the region where the continuous and the discrete number- 
ing disagree. 

Proof of (|3.2|) : To bound Afyo), the number of parts in £(0), observe that all the pieces in p of 
size less than 1/n can give rise to at most Yli n Pi^-npi<i parts (singletons) in £(Q). Therefore, 

Afyo) = #®2 (p)[{h ■ ■ ■ ,n}] = ^2^-n Pl >i + npil nPi< i 

i 

< Yl^P^Pi^ + ^PiT^i = n a Y,Pf- (3-7) 

i i 

Consequently, due to assumption (|3.1|) . 

E^[N m ]=0(n a ) (3.8) 

and hence, for all j3 > a, 

[ityo) > n' 3 ] < n-^W[iV l(0) ] = 0(n«A 

thus proving Q3.2B . 

We now define informally the kernel of the Markov chain (ck,dk,ek) with state space 
Assume that the current state of the Markov chain is (c, d, e). To compute the state (c,d,e) the 
Markov chain is going to jump to in the next step we generate four random variables £i, £2, Ci an d C2 
such that £1 and £2 and (Ci, C2) are independent of each other and of everything else and such that the 
£i are uniformly distributed on [0, n) and (Ci, C2) is uniformly distributed on [0, n) 2 \ Uj=ib' ~~ 1, j) 2 - 
The £i will serve to sample uniformly with replacement from [0, n) whereas the Q will be used to 
sample uniformly without replacement from {1, ... ,n} in case the £j have chosen the same atom 
in d twice. The new continuous partition c is then defined as follows: 

If c(&) + e(6): 

d r x) = { c (6) if c{x) = c(6) / 3 g) 

I c(x) else. 

If c(ei) = c(6): 

c(x) = { n ^ (C ' d) * C(X) = C(?l) and x > ^ (3.10) 
[ c(x) else. 
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We see that the two parts are indeed chosen with probabilities given by their size. In (|3.9|) two 
different sets, of sizes Leb(c- 1 (c({^})), i = l,2, have been selected and are merged by assigning 
the set c _1 (c({^2}); hit by £2, the number c({£i}) of the set c _1 (c({£i}), selected by £1. This creates 
a new set c _1 (c(£i)) with Lebesgue measure Leb (c _1 (c(£i))) + Leb (c (c(£2)))- 
In (|3.1U|) the set c -1 (c({£i}) = c _1 (c({£2}) is chosen twice, so it has to be split. Since £1 is 
conditionally uniformly distributed on this set we can reuse it as splitting point for that set: The 
part to the left of £1 retains its old number c(£i) = 0(^2) whereas the part to its right gets a new 
number new (c, d), which is not in the range of c or d. Note that it is always possible to find such 
a new number since Z\c[[0, n)] is assumed to be infinite. By comparing this with the definition of 
CCF given at the beginning of the Introduction we see that p(k) defined in 1)3. 6|) is a CCF Markov 
chain. 

In the discrete case, the two parts chosen are the ones containing the numbers [£1] and [£2], 
which ensures that the parts are chosen size biased. The rule for merges in the discrete partition 
is analogous to (|3.9j) : 

If d(Ril) + d(tel): 

d{m) = * d{m) = d{m) (3 . n) 

[ d{m) else. 

Here two different parts with numbers d([£i]) and ^([£2]) have been chosen. They are merged by 
giving both of them the number 

The rule for splitting is slightly more complicated. If the same part (but not the same atom) is 
sampled twice by the £j then again, as in the continuous setting, £1 determines the point at which 
the set d -1 ({r£i]}) is going to be split: The points to the left of and the points to the right 
of will constitute the two new fragments. The point itself will be attached to the left or 
the right part in such a way that the splitting rule for DCF^ n \ given in 1)2.2)1 and 1)2.3)1 . is imitated. 
This is done as follows: 

if d(r6i) = d(r&i) and [6i + real = 

new (c, d) if d(m) = and m > 



dim) 



dim) else. 



r, n A t^\t\ #d-\{dj\t{\)})^%t{\ 
orm= [61 and & < L 6J + #^1(^(^1)})-! (3 " 12) 



Indeed, consider for simplicity the case that the atoms of the set d l i{di\S,\\)}) are not scattered 
around the whole set {l,...,n}, which they typically will be, but are collected at the bottom: 
d ~ l ({ d {\£i\)}) = {!,-• • ,a}, where a := ttd _1 (M Ril )})• Definition (|3TT2|) tells us that this set is 
split into {1, . . . , j} and {j + 1, . . . , a} if 

3 ~ 1 + ^— !" <Zi<3+ ' 



1 J o-l 

Conditioned on £1 G [0, a), the probability for this to happen is l/(a — 1). This means that the 
discrete set {1, ... ,a} is indeed split as described at the beginning of Section |2~21 

If however the same atom in d has been sampled twice by the £j's, i.e. = [£2! > then £1 and 
£2 are disregarded and d(n) is defined as in ()3.11)) and ()3.12)1 but with (£1, £2) replaced by (£1, £2) in 
order to sample without replacement. The process £(k) defined in ()3.6)) is a DCFW Markov chain. 
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Figure 2: Merging the parts with numbers 1 and 2. 



It remains to define e: 




1 if Ril = [61 or c(£i) ^ d(&) or c(&) / 
e else. 



In the case e = 1 the coupling has broken down: Either the same atom in the discrete partition 
has been sampled twice by the &'s or at least one of the £j's belongs to non-corresponding sets in 
the continuous and the discrete partition. The time r := inf{fc > 1 : = 1} is regarded as the 
decoupling time of the chains p(k) and £(k). 

(n) 

The definition of the transition kernel for the Markov chain on is now complete. It is summa- 
rized in Figures [21 to 0J 

Proof of (EO|) : We denote by 

Qk '■= Feb {{x G [0, n) : c k (x) ^ d k (\x]}). 

the discrepancy between c k and d k . For k = 0, this is the roundoff error caused by the approximation 
of Co by do ; its size is the length of the shaded area in Figure ^ Note that 

Qo < %>) (3-13) 

because any part in do might disagree with cq at most in its right most atom. Moreover, can 
increase in each step by at most 1 as long as k < r: Indeed, if two parts are merged, Qk does not 
increase at all (it might even decrease) whereas it might increase by at most Leb j Ri]]) = 1 
in case of splitting. Hence, Qk+i < Qk + 1 if k < r and therefore, 

Qk < + k on the event {k < r}. (3-14) 
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Figure 3: Splitting the part with number 4 into a part with number 4 
and a part with number new (c, d) = 0. 



Since the | • |i-diameter of Oi is at most 2 we have 

t{\n?\) 



#0) 



KL^J) 



p(L^J) 



+ 2QW[r< \pP\ 



(3.15) 



We are going to bound the first term in (|3.15|) first. It is easy to see that \p— q\i > |sort (p) — sort (q)\i 
for any two summable sequences p = (pi)i and q = {qi)i of non-negative numbers. Indeed, if pi > pj 
and qi < qj, then swapping qi and qj would not increase \p — q\\. Therefore, by definitions ()3.4j) . 
(|3~5|) and (|3~5)) on the event {[n^J < r}, 

p(KJ)-^ 1 ^ 11 < ^E| Leb (<,j(W))-#^j(W) 

1 i>l 

- \ Leb ("t x : i e ^ c KJ d KJ C KJ ( x ) ^ d KJ 



i>l 



< -Q\n?\ < ~(eo + lVJ) < -(^(o) + KJ) 



n 



n 



by 1)3. 14|) and (j3.13j) . Consequently, due to (|3.8j) . the first term in 1)3. 15|) is of order 0(n a 1 x ) 
thus going to as n — > oo. 

To show that the second term in ()3.15|) goes to as well we assume without loss of generality 
that a < (5 < 1/2. Consider the probability that a chain which has not decoupled until the kth. 
step will decouple in the (k + l)th step. Given qq, . . . , g^, the event that £i samples two different 
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Figure 4: Two ways to decouple the chains: Sampling from the region 
where c and d disagree (£2, top) or sampling with replacement from d 
(£1 and £2; bottom). 

parts in ct and dk has probability Qk/n. The same holds for £2- Moreover, the event that one 
atom in dk is sampled twice, i.e. that [£1] = [£2] has probability 1/n. Therefore, the probability 
that either of these events occurs and the chain decouples is at most (2^ + l)/n. On the event 
{t > k, qq < n@} this can be bounded from above due to (|3.14|) by (2(n^ + k) + l)/n which is less 
than 5n^ _1 if k < nr. Thus we get by induction over k, 



Q^\ T > k,Q < nP] > (1 



5n /3 - 1 ) fc Q!r ) [£>o<n' 3 ] 



for all k <nP and hence 

Q<W[t> Kj 



> 



1 — 5n 



0-1 



,2/3-1 



00 < « 



(3.16) 



Due to 2/3 — 1 < 0, the first factor in (|3.16|) converges to one as n —* 00. The same holds for the 
second factor due to (|3.13|) and (|3.2j) . Consequently, also the second term in (|3,15j) goes to 0, which 
completes the proof of ()3.3|) . □ 



4 DCF^ convergence 



(n) 

It was mentioned in the Introduction, that the uniform rate of convergence to tt s is too weak 
to combine properly with n — > 00. However, according to the following theorem (to be proved in 
subsection 14.2(1 . the situation is better when starting off from partitions with relatively few parts 
and restricting our attention to a certain family C of f^i-neighborhoods to be defined below. For 
every n G IN and j3 G (0, 1], thus, denote accordingly 

Tn,f3 = G V n : N e < r/} = [l G P n : ^ = } . 
As for the definition of C, for each ZcGlN let 



I k = { (a,b) = (<H,bi 



0<a,i<bi<l y^bj<l, dfc>l-V/ 



(4.1) 
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and denote # 0) b=min |l — h > — (1 — Si=i a i)f- Then, for each (a,b) E Ik, define 

C a ,b = E ^1 : X{ e (a,i,bi) for ? = 1, . . . , fcj (4.2) 

which is nonempty if and only if < aj< min bj for % = 1, . . . , k, in which case the conditions 

1<J<* 

on (a, b) E guarantee that 

C a ,6={^ = : x' E G aib , x" E (1 — l^liJOi} . (4.3) 

h 

(Here ( • , • ) denotes concatenation and G a ^ is the (nonempty) subset of points in fli-iC ^; whose 
coordinates are nonincreasing) . Moreover, I^s definition (|4.1j) implies that 

S a ,b < \x"\i < x'k ~ S a,b , V(x',x") E C 0j6 . (4.4) 

Finally, 

C = {C a , 6 : (a,6)E4, fc > 1}. (4.5) 

The family C of ^-neighborhoods will be shown in Section|5]to be sufficiently rich to characterize /2i 
uniquely. At the same time, and as a result of their special features (|4.3|) and (|4.4j) . the convergence 
of the DCF^ n ^ to its equilibrium is fast on the sets in C: 

Theorem 4.1 Fix E (0, ±). For each n E IN /ei (A"( n )(/e)) fc>0 6e a L>CT< n ) Markov chain with 

underlying probability measure and initial distribution /ig £-Mi(P n ,/3)' T/ien /or any CeC, 
(3 1 > j3 and integer sequence k = k n > n^' 

A { r\k) := P (n) f X( n )(fc) E n<7) - 7ri n) (nC) — ► . 

V J n— +oo 



4.1 Characters in 5 n — Background 

Recall that the partition space V n can be viewed as the quotient of the permutation group S n under 
conjugacy. Thus the natural inner product on F n :={/ : V n — * M} is 

(f,9) = (f,9) n = £ /(7)ff(7)4 B) (7) • 

The fact mentioned earlier that tt^ is a reversing measure for the DCF(") means precisely that 
is selfadjoint with respect to this inner product. 

The following basic facts regarding the character theory of S n , as well as the full theory, can be 
found, for example, in ^Uj, and their relevance to random group actions (such as transpositions 
in our case) in [I] and The characters {x} of S n (traces of the irreducible representations) 
are functions on S n , constant on conjugacy classes, and as such can be seen to be functions on 
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V n . They are orthonormal under ( • , • ) and since there are j^V n of them, they are indexed by the 
partitions ( (X\)xev n ) an( ^ f° rm an orthonormal base of F n . 

bince represents a random transposition, its dual K^ n > acts on M\(S n ) as a convolution 

K^* u = ★ a ( (transposition) = — — ^ — - and otherwise] 

\ n(n — 1) J 

as a result of which, and of a corollary (|3J Ch. 2, Prop. 6]) of Schur's lemma, 



a) if( n )'s eigenfunctions are the characters {Xx)\eV n 

, v , , „(n) , . , XA( trans P° sition ) 

b) the eigenvalue t> x corresponding to X\ 1S given by (identity) — 



A result of Frobenius in principle provides formulae for all characters. Although in general they 
can be intractable, this is not so at transpositions and at the identity, thus yielding ([1J D-2,p.40]) 



Jn) 

7 \ 



^ ) p ji , j -2 J + l ) = ^ ) {±^-p^ . (4.6, 



(A's adjoint partition A' is defined below). In particular 0/™ o \ = 1 and X (n j — 1- 

For many purposes, a partition A G V n can be best described by its Young diagram Y A (Fig. |SJ), 
consisting of N\ rows of Ai, . . . , Xn^ cells respectively, in terms of which some useful features of A 
can be defined. The j'-th cell in row i is denoted 

• A' € V n is the partition whose Young diagram is obtained from A's by transposition; T , = 

• B\ = max{i : G T A } = max{i : Aj > i} (A's diagonal length) 

• R\(i, j) = {(u,v) : i<u<X'j, j v A M+ i < v < X u } (Y A 's rim segment straddled by (z,j)) 

• ^\(»>j) = \ ^a(*j j) defines Ai 4 '^ (a diagram obtained from A's by removing a rim segment 

is a Young diagram; this defines the partition A* ) 

In addition, for any 7 G "P n , define 7 r = (71, . . . , 7J., . . .) G "P n _ 7r , the partition obtained from 7 by 
removing its r-th part. The following Murnaghan-Nakayama rule (see |H1 Theorem 3.4]) provides 
a way of recursively evaluating characters: for all A, 7 £ P„ and l<r<iV 7 

x*(t)= E (-^W^ (47) 

in the sense that the sum is zero if its index set is empty, and x (0) = 1. Thus, for a fixed order 
in which 7's parts are chosen, x x (7) can be calculated by covering all possible ways of successively 
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A = (8,8,7,4,4,1,1,0,...) 
A' = (7,5,5,5,3,3,3,2,0,...) 



7 = (10,9,7,3,2,2,0,...) 

i = (6,6,4,3,3,3,3,2,2,1,0,...) 



£ A =4 



Aa(2,3) 



Al 2 ' 3) = (8,6,3,3,2,1,1,0,...) 
= (7,6,3,3,3,1,1,0,...) 



7 2 = (10,7,3,2,2,0,...) 



Murnaghan-Nakayama rule: x a (t) = A {1 , 4) (7 2 ) ~ A (2j3) (7 2 ) 



Figure 5: Young diagrams of A, 7 € V33. Two A-cells, (1,4) and (2,3), 
generate rim segments of size 9, the latter shown explicitly, which the 
Murnaghan-Nakayama rule "peels off' together with the deletion of 72. 



stripping off 7 r -sized rim segments from A's diagram, and X a (t) = if it is impossible to exhaust 
Y A entirely in this way. In particular 

iV 7 <5 A= >x A (7) = (4.8) 

since any rim segment of A contains at most one diagonal cell 



4.2 Proof of Theorem 14.11 

Before proceeding with the proof itself, it will be helpful to characterize the 7 G V n which belong to 
nC = nC a ,b for given fee IN and (a, b) Elk (assuming 0). It follows from C aj 6's description (|4,3[) 

that any such 7 can be expressed as a concatenation (7', 7") where and 7 // G7 3 n _iyi 1 , and 

where G^l consists of nonincreasing integer valued ^-sequences 7' which by virtue of (|4.4|) satisfy 

i)|V|i<n ii) 35 = 5(C)>0 such that j' k > (n-|Y|i) + Sn . (4.9) 

This state of affairs is illustrated in Figure El 
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7i 



7 



7 



7 = (Y,7" 



y* > iri> 



Figure 6: A partition 7 in nC at b splits into its first k rows 7' and the 
remainder 7" which is nonempty but smaller in size than 7"s last row. 



Proof of Theorem 14.11 Fix C G C and define f n = l n c- Then, in terms of /^q^'s density 
„M,V) _ a4"'(7) , 

50 (7) "4"'(7)- 

7 AeP„ 
and, since 0^ O)<<< ) = 1 and X(n,o,...) = h 

»(„n\ - if i\ - a( n ) k /„( n ) 



so that 



4'>C) = (/ B ,l) = ^ ,...)(5r,X ( „,o,..,)(/n,X K0 ,... ) > 

(n,0,...)^AeP„ 

» 



(4.10) 



By assumption, <?rj (7) = whenever iV 7 > ra^ 3 . On the other hand, x A (7) = whenever i?^ > 
and iV 7 < nP by the consequence (|4.8j) of Murnaghan-Nakayama's rule. Thus ()4.10(l becomes 



Ag^)= £ 'r U n) ,x A )(/n,x A > 

(n,0,...)^AGP n 



Now choose an r] such that l — {(3'—(3) < 77 < 1 and let no = S 1 -') . Then, for all n>no, 



where 



=(£ + £ + £ 



o{ n)k (ai n) ,x x )(fr 



n,X A ) 



' n 


= K(v,P) = 


{A G V n : 


#a < 


Ai 


<n-2n v , N x <n 


V" 

' n 


= K(v,P) = 


|a G V n : 


^a < nP, 


n- 


-2n r ><\ 1 <n\ 


V'" 


= K(v,fl = 


{ A G V n : 


Bx < n/ 3 , 


n- 


-2n r i<N x \ 



(4.11) 
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(Our choice of no ensures that V% and P^' are disjoint and that (n, 0, . . .) g" P^'O- It turns out that 
for n large enough, the terms in (|4.11|) vanish for all A G P" U P"', whereas when A G P^ the factor 

is sufficiently separated from 1 : 
Lemma 4.2 3n x = ni(/3,C) suc/t that {f n ,X x ) = °> Vn>m, VAgP"UP"' . 

Lemma 4.3 For a// AGP n , |#^| < ^ r/^ A and £/ms 3n 2 = n2{rf) such that 

< g-n"- 1 Vn>n 2 , VAgP^. (4.12) 

Proof of Lemma 14.21 Consider first A G P"- Now, C = C a {, for some /c G IN and (a, 6) G so 

that, as discussed at the beginning of the section and illustrated in Figure H3 7 can be split into 
(Y,Y')and 

<fn,x x )= E E ^(V^'Oxjy^'o- (4.i3) 

' <x,o 

(Note that property (|4.9t ) guarantees that the inner sum is not vacuous, i.e. IV' 1 1 > 0)- 

We shall show that for every fixed 7' G the inner sum in (|4.13jl equals zero. First apply 

Murnaghan-Nakayama's rule (|4.7j) k times to \\ (V) 7") D Y successively stripping rim segments from 
A, of lengths V at each stage i, i = 1, . . . , k. On the one hand Ai > (1— <5)n for n > n'{ = n'{((3, 77, C) 
(since A G P")> and on the other V > <^ n ; * = 1, • • • , & (by (|4.9i i)). This implies that at each of 
these k reduction stages precisely one rim segment can be stripped off, namely the last 7^ cells of 
whatever remains of Ai, i = 1, . . . , k. Summing up 

x,(V,V0=x A .(V) ( 4 - 14 ) 

where A* G V n _\^\ x is defined by A^ = Ai — |7'|i and A* = Xj, j > 2. As for the first factor of the 
summand in ()4.13|) . note that (|4.9t il implies 7^ > 7" (see Figure IHJ) and thus 

1 :R(Y). (4.15, 



4" Hy|,> (v) nU-iUjW- 

Inserting (|4.14j) and (|4,15|) in the inner sum of (|4.13|) we obtain 

E 4\i ii')x,{i = i?(7 , )(x A ,,i)„. iyil =0 

n_ 17 1 1 

since A* is not the trivial partition, that is A* 7^ (n — |V|ij 0, • • •), (because A / (n,0, .. .)), and thus 
X x « is orthogonal to X(n-|7'|i,0,-) = L 

The proof for A G P"' is similar, with n > n'" = n'"(f3,r],C), where now the only rim segments 
which can be stripped off from A are from its first column. It remains to define n\ = n'/vn'". □ 
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Proof of Lemma 14.31 Using the formula for 9 ™ given in 1)4. 6|) , 

whereas, by duality, — 0^ = offi < = Moreover, for X eV' n , 

K n) | < fl- -A- l = fl- V <e-™"" 1 as soon as "< - . 

□ 



We now continue with the estimation of 1)4.11(1 . As a result of Lemma 14.21 and Lemma 14.31 and 
recalling that k>n^', it holds for any n > niVn-2 that 

< E ^^"W.XjIK/n.Xjl- (4-16) 

To estimate the number of terms in ([4.16)1 . note that the Young diagram T A of any A G V n with 
B\ = s consists of an sxs square of cells, with (certainly no more than n — 1) cells added to each 
one of the square's s rows and s columns. Ignoring the various additional constraints, there are n 2s 
ways of making such additions, and thus for any t > 0, ^{X&Vn : B\ < t} < tn 2t , so that 

#V' n < #{XeV n : B x < n?} < n^n 2nP < e 3n0lo ^ n . 
As for the terms in ([4.16)1 . \{fn,X\)\ < 1 by the Cauchy-Schwartz inequality, and 

sup \{9^,X X )\ < sup | XA ( 7 )| < n nf) = e nn °z n 

-yET n :N- f <n 13 

where the second inequality follows from applying Murnaghan-Nakayama's rule at most vP times, 
each time with not more that n terms in the sum (|4.7jl . 

The above and 1)4. 16j) imply that for all n>max{ no, 1-1,^2} 

< exp { -n? ( n W-P)-£-v) - 4 log n) } . 
_j>_ 

< e 2 ; which concludes the proof of Theorem 14.11 . □ 



A™(fc) 



Eventually, thus, A^'\k) 
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5 Proof of Vershik's conjecture 



This section is devoted to the proof of Coniecture ll.il which we restate as 

Theorem 5.1 If fx G is CCF-invariant then fi is the Poisson-Dirichlet measure fix. 

The main ingredients in its proof have been established in Sections 121 El an d 0]and are, respectively, 
the a priori finite moment estimate Proposition 12.11 the couplings with approximating DCF^'s 
of Theorem 13.11 and the fast convergence to equilibrium of the DCF^™) chains in the sense of 
Theorem 14.11 

Proof of Theorem I5.lt 

Let fij = {p G Q\ : 3 infinitely many n G IN such that p n > J2j> n Pj}- We- shall show that 

= 1 (5.1) 
{C n flj : C G C} C Bq> is measure determining on (Vl' 1} B^) (5.2) 

/x(C)=/2i(C) VCGC (5.3) 

which together imply in particular that = 1, and indeed the theorem's statement as well. 

Proof of (|5.1|h Recall jui's description as the law of the nonincreasing rearrangement of the 
uniform stickbreaking process X n (with Y n = 1 — ^2j <n Xj the remaining stick length prior to the 

n-th break and X n = U n Y n ) and define ti=1, Tk+i= min{n>T£; : X n A(Y n — X n ) < Xj, Vj<Tk} 
for fe>l. Since a.s. X n \0, each r k is finite. We claim that 

A k :={U Tk >^} = {X Tk > Xj} are independent, P(A k ) = - \/k . (5.4) 

j>T k 

This implies that a.s. U Tk > ^ infinitely often, and these n = r k will be the ones alluded to in f^'s 
definition. Indeed, on A k ^2 j>T Xj <Xi \/i < r k , so that the nondecreasing permutation of the 

Xi's decouples on [l,Tfe] and (r k ,oo) and thus p Tk = minj< Tfc Xj > J2 3>T Xj = Yl j>T Pj- 

C y n if 77 n = 1 

To prove (|5.4j) . represent the splitting variables as U n = < n T/ . f _ n , where V n ~ U[0, 1] 



1-K ifr/ n = 

c c 

and i] n ~Bernoulli(0.5) are independent of each other, and write A k = {B k f] C k ) U [B k f] C k ), with 
B k = {V Tk >\] and C fc = {r/ Tfe =l}. Ther fc are J^-stopping times, where T n = a(Vi, . . . , V n , r/ , . . . ,ry n _i) 
(arbitrarily set 770 = 1) so that i?^ G and is independent of T Tk (in particular P(C k ) = 0.5) 
for all k. For any D G T Tk 

P(DnA k ) = P({DC\B k )C\C k ) + P({Dr\B C k )C\C C k ) 

= P(DnB k )P(C k ) + P(DnB k )P(C k ) = ±P(D) . 
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Choosing first D = Q\ and then D = Dj^jAj with J C {1, . . . , k — 1} (indeed, Aj G ^v,-+i C ^Vj, 
for j <k), we respectively obtain P(A/ t ) = 0.5 and the independence of the ^4 n 's. We have thus 
proved Q5.4JI and thus <|5.1|) . 



Proof of (|5.2|) : Fix e>0, pGf^, and choose k large enough so that 0<q-=^2 >k Pj < ^ m ^ xl (Pk > §)• 
Then let (5 = gA ^ 2 "' ? ^ an< ^ a i = Pi~^ h=pi+5 for i = l, . . . , fc. We claim that (a, b) G/fc. Indeed, 

y^&i = y^(pj + 5) < 1 - c? + ^ -q < 1, whereas 

i<fc i<fc 

a fc + ^ a, = (p* ~ 5 ) + - <5) = (pit + 1 - g) - (fc + 1)* > l • 

j<fc i<fc 

By definition p G C ai 6. Moreover, for any x G C aj b 

k oo 

|_ T , _ „,l +9 \" „, < 9^ 4- 9o < 9(1 + 

fc + 1 



p|i< 2^ \x j -pj\ + 2 Pj < 2kS + 2q < 2(1 + -j^-j)q < e 
3=1 j=k+l 



which shows that for any open Zi-ball B e (p) in there is some CgC such that pGCflO^ C B e (p). 
In other words, {C HO^, C G C} generates O^'s topology. 

To conclude the proof of Q5.2|) we need to check that C is closed under intersections. For any j < k 
then, let (oi, bi) G Ij and (a2, ^2) £ A:> an d if j < k denote a\ i = and b\ . = 1 for i=j + 1, . . . , k. 
It follows immediately that (a, b) defined by aj = ai. V 02^ and bi = b\. A b%. for z = 1, . . . , k 
belongs to I k , and C ai>bl n C a2if , 2 = C aib . 

Proof of (|5.3|) : First note that if (a, b) G Ik then ((l-fe)a, (1— e)b) G Ik for all e in some neighborhood 
of 0, and if C := C a b 7^ 0, then so is C( e ) := C(i+ e ) a ,(i-e)6 f° r an e m a neighborhood of 0. 

Once we show that for all e > small enough, 

let e\0 and use jui(SC) = to obtain //(C) = /2i(C) for every CgC, thus proving (f5.3|) and with 
it the theorem. 

Let |<a</3<7<^be three otherwise arbitrary numbers. Since by Proposition l2.H u satisfies , 

we consider for every nG IN the probability measure = Q^/ introduced in Prop osition 13 . 1 1 which 
is defined on a space which supports both a CCF Markov chain p(-) with \i as its stationary marginal 
and a DCF^ n ^ Markov chain £^ n \-) which "emulates" p(-) in terms of its initial law (cf. (|3.2|l ) and 
in the sense that they remain close after n 7 units of time (cf. Q3.3|) ). For any n G IN, 

^C)-MC {£) ) = Q {n \p{[n-<\)€C)-MC {£) ) 

> (p( [n'\ ) G C) - (± £^ ( Ln 7 J ) G 

_ g( n ) ^i£W(Ln T J) G C (e) ) -7r (n) (nC (e) )| + (V n )(nC^) - /Ii(C^) 
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The first term is estimated using a simple union bound with e' := eminjaj : 1 < i < k }, and (j3.3|) : 



-Q 



(n) 



p(L» 7 J)-^ (b> (Lt» 7 J) 



QOO 



p(K j)_I^) (Ln 7j ) 



-> . 



1 n—*oo 



To estimate we would like to apply Theorem 14. II to the sequence of discrete processes £^ n \-). 
Their initial laws, however, are guaranteed by ()3.2j) to be only nearly supported on V n a, respectively, 

but not totally as required by Theorem EJ Define thus Q (n) (-) := Q [n) ( ■ I ^ (n) (0) E V n> p) ; 

obviously Q(")(^ (n) (0) E P„ i/3 ) = 1, and under Q (n) , remains a DCF^ chain. Then, 



Df < 



Q(n) ^)(L„7j) g nC -( B ) , \ _ + ||Q (n) - Q (n) | 



->■ 0. 



Here we applied Theorem O for the first term, while \\Q {n) - Q (n) || var < q(I) ff) jojgp"'^ ~> 
by p . 

Finally, recall that 7ro^(n«) — > /2i weakly ([131 HE!), and since C( £ ) satisfies /2i(dC( e )) = 0, it follows 



that limvr^ (nC (e) ) = £i(C( s )). Thus Urn Dg } = 0. Consequently 



n— >oo 

//(C)-/2i(C^) > lim^P - En.D< e) + limZ^ £) > . 



The reverse inequality Jli{C^ e -*) > //(C) is obtained similarly from 



Mi 



(c ( - 



//(C) > -L>j~ e) - D 



□ 



6 Appendix 

Proof of Proposition 12. ll Consider the partition of (0,1] by J n := (2~ n ~ 1 ,2~™] (n > 0) and 
define on f!i the random variables 

W n := ^2pil pt>2 -n (n > 1). 

%>l 

Fix n > 1. If two intervals are merged then W n can only increase and if some interval is split then 
W n can only decrease. We call the increment in the case of merging A + > and the loss in the 
case of splitting A_ > 0. Given p, we can bound A + by 

A+ > J2pf Pj l pi£ J n l pj>2 -n-l 




and compute A_ as 
A_ 



x) Pt <2-"< Pt 



y^Pi / Xpil xp .< 2 -n <pi + (1 - X)p i ll 1 

i Jo 

2^^ 1 2-"<p l f xl x<2-*/ Pi dx = 2^p, 3 l 



(It 



i - L 2-™<p i 



ar 

T 







2-"< Pi - 



Therefore, 



A_i_ - A_ > 2 



-2n 



> 2 



-2n 



2 -n-l <pj 



1 • 



Since J A + — A_ dfi = due to stationarity this implies 



4 > 



> 



, i 1 Pi>2- 



dfj, 



/(Sw.) (x>a 

2 ~"~ 1 /(ew.J ^ > 2-"-M y E w. d A ■ 

Since this holds for any n > 1 we get 

with /3 = 1/2. Therefore, for any a > (3 = 1/2, 

/l>? < E 2 ~ Qn / E w» ^ < cE 2n(/3_Q) < oo» 



(6.1) 



(6.2) 



(6.3) 



for some constant c > 0, thus proving (J2.1|) for a > 1/2. We shall now use this result to extend 
it to all 2/5 < a < 1, as required. To this end, observe that we have due to (|6.1|) for arbitrary 
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</?< 1, 



4 > 



-„-i < 2- n/3 



> 2 



> 2 



-n/3 



E 1 p» ej " d ^ 



2 -n/3 



E^ 1 



Pi>2- 



i < 2 



-n/3 



(6.4) 



To bound the //-probability in the last expression we recall from (|6,3j) that for 1/2 < a < 1, 



oo > 



Erf ^ / ferfj 1 { v * : ^ ^ 2_n/3 } ^ 



(6.5) 



for any n > 0. On the event {Vi : p-i < 2 ra ^}, by Jensen's inequality, 



a-l 



Erf = E» -rf" 1 ^ E» ■» ^ 2n/3(1 ~ 



a) 



Therefore, for all 1/2 < a < 1 due to p[Vi : Vi < 2" n/3 ] = O^-^ 1 -")) as oo. 

Substituting this into Q6.4|) we get that for any 1/2 < a < 1 



/3+n)-n/3(l-a)N 



0(2 



nmax{/3,l-/3(2-a)}N 



The choice /3 = (3 — a) 1 minimizes max{/3, 1 — /5(2 — a)} and therefore yields that (|6.2I) and 
consequently also flE2) and (J2HJ hold for any a,/3 > (3 - 1/2)" 1 = 2/5. □ 

Remark: A posteriori, once it has been established that [i must be the Poisson-Dirichlet law, 
Proposition 12.11 holds for all a > since by |11| (20)] 



Erf^ 



j>i 



i _ i 

i — / x a 1 dx = — 

o « 



Note added in proof: In [13], the question was raised as to whether the state s := (1, 0, . . .) £ fii 
is recurrent for the CCF. Our techniques allow one to respond affirmatively to this question. Indeed, 
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let X( n \k) denote the state, at time k, of a DCF( n ) initialized at X^(0) = (n, 0, . . .) =: s n . The 
recurrence of s for the CCF then follows, by the coupling introduced in Theorem 3.1, from the 
existence of a constant C independent of n and k < n such that P^ n \X^ (2k) = s n ) > C/k. To 
see the last estimate, note that by the character decomposition at the beginning of the proof of 
Theorem 4.1, it holds that 

p<»)(xW(2*) = ,„)= £ ^ )2fc ((4 n) (^))- 1 i s „,x A )(i s „,x,)= E WWx» J W. 

From (1.1), 71"^ = 1/n. By the Murnaghan-Nakayama rule, x A (s n ) = unless A = (i, 1, 1, . . .) 

for some i G {1, . . . , n}, in which case |Xa( s ™)I = 1- Using (4-6), one has that for such A, 6^ = 
(2»-n-l)/(n-l). Thus, 

i=l V 7 i=l V 7 i=l 

where we used that (1 — x) > e~ x /2 for x < l/y/2. This yields the claim. 
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